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Abstract. We obtain a number of results regarding the distribution of values of 
a quadratic function / on the set ofnxn permutation matrices (identified with 
the symmetric group S n ) around its optimum (minimum or maximum). In partic- 
ular, we estimate the fraction of permutations a such that f(a) lies within a given 
neighborhood of the optimal value of /. We identify some "extreme" functions / 
(there are 4 of those for n even and 5 for n odd) such that the distribution of every 
quadratic function around its optimum is a certain "mixture" of the distributions of 
the extremes and describe a natural class of functions (which includes, for example, 
the objective function in the Traveling Salesman Problem) with a relative abundance 
of near-optimal permutations. In particular, we identify a large class of functions 
/ with the property that permutations in the vicinity of the optimal permutation 
(in the Hamming metric of S n ) tend to produce near optimal values of / (such is, 
for example, the objective function in the symmetric Traveling Salesman Problem) 
and show that for general /, just the opposite behavior may take place: an average 
permutation in the vicinity of the optimal permutation may be much worse than an 
average permutation in the whole group S n . 



1. Introduction 

The Quadratic Assignment Problem (QAP for short) is an optimization problem 
on the symmetric group S n of n\ permutations of an n-element set. The QAP is one 
of the hardest problems of combinatorial optimization, whose special cases include 
the Traveling Salesman Problem (TSP) among other interesting problems. 

Recently the QAP has been of interest to many people. An excellent survey 
of recent results is found in [5]. Despite this work, it is still extremely difficult 
to solve QAP's of size n = 20 to optimality, and the solution to a QAP of size 
n = 30 is considered noteworthy, see, for example, [1] and [4]. Moreover, it appears 
that essentially no positive approximability results for the general QAP are known, 
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although some "bad news" (non-approximability) and approximability for special 
classes have been established, see [3] and [2]. 

The goal of this paper is to study the distribution of values of the objective 
function of the QAP. We hope that our results would allow one on one hand to 
understand the behavior of the local search heuristic, and, on the other hand, 
to get guaranteed approximations to the optimum using some simple algorithms 
based on random or partial enumeration with guaranteed complexity bounds. In 
particular, we estimate how well the sample optimum from a random sample of a 
given size approximates the global optimum. 

(1.1) The Quadratic Assignment Problem. Let Mat n be the vector space 
of all real n x n matrices A = (aij), 1 < i,j < n and let S n be the set of all 
permutations a of the set {1, . . . , n}. There is an action of S n on the space Mat n by 
simultaneous permutations of rows and columns: we let cr(A) = £?, where A = (aij) 
and B = (bij), provided b a ^) a ^j) = a^ for all i,j = 1, . . . , n. One can check that 
(<jt)A = <j(tA) for any two permutations a and r. There is a standard scalar 
product on Mat n : 

n 

(A, B) = aijbij where A = (a^) and B = (bij). 

Let us fix two matrices A = (a^) and B — (bij) and let us consider a real- valued 
function / : S n — ► R defined by 

n n 

(1.1.1) f(a) = (B, a (A)) = K(i)<7U) a i3 = Yl K^-Hi^-Hj) 

i,J = l M = l 

The problem of finding a permutation a where the maximum or minimum value 
of / is attained is known as the Quadratic Assignment Problem. It is one of the 
hardest problems of Combinatorial Optimization. From now on we assume that 
n > 4. 

In this paper, we study the distribution of values of / from the optimization 
perspective: 

• How "steep" or how "flat" can the optimum of / be? 

• How many values of / lie within a given distance to the optimum? 

• When can we hope to improve the value of f(a) by modifying a slightly? 

To formulate the questions rigorously (and to answer them), we introduce the 
standard Hamming metric on the symmetric group S n . 
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(1.2) Definitions. For two permutations r, a G S n , let the distance dist(a, r) be 
the number of indices 1 < % < n where a and r disagree: 

dist(r, a) = \i : a(i) 7^ t(i)\. 

One can observe that the distance is invariant under the left and right actions of 

dist(crai, oo'i) = dist(o"i, a 2) = dist(o"icr, a" 2 a") 

for all <7i, o"2, a G S n . 

For a permutation r and an integer fc > 1, we consider the "/c-th ring" around 

r: 

t/(r, fc) = {a G S'n : dist(a, r) = k}. 

In particular, we are interested in the distribution of values of / in the set U (r, k), 
where r is an optimal permutation. 

(1.3) The generalized problem. Our approach produces essentially identical 
results for a more general problem, where we are given a 4-dimensional array C = 

: 1 < i,j,k,l < n j of n 4 real numbers and the function / is defined by 

n 

(1-3-1) /M = E 4o»or 

If = for some matrices A = (a^) and B = (bf-i), in which case we write 

C = A ® £?, we get the special case (1.1.1) we started with. 

The main idea of our approach is as follows. Let 

be the average value of / on the symmetric group and let f = f — f . Hence the 
average value of fo is and we study the distribution of values of fo around its 
maximum (the problem with minimum instead of maximum is completely similar) . 
Now, as long as the distribution of values of fo is concerned, without loss of gener- 
ality we may assume that fo attains its maximum on the identity permutation e, so 
that /o(e) > fo(cr) for all a G S n . Let us define a function g : S n — > M, which we 
call the central projection (with the term coming from the representation theory) 
of /by 

(1-4) 9(a) 



3 



It turns out that g attains its maximum on the identity permutation, that the 
average value of g on S n is and, moreover, the average values of fo and g on the 
k-th ring U(e, k) coincide for all k. In short, g captures some important information 
about the distribution of /. The set of all functions g obtained by central projection 

(1.4) from all functions fo having maximum at the identity forms a 3-dimensional 
convex polyhedral cone. We describe this cone, identifying its extreme rays (there 
are 4 for even n and 5 for odd n), which provide us with some "extreme" types of 
distribution. Hence we study the distribution of values of g, which is a much easier 
problem. Once the distribution of values of g is understood, using (1.4), we infer 
various facts about the distribution of values of /. 

We remark that it is easy to compute the average value / of / given by (1.1.1) 
or by (1.3.1). 

(1.5) Lemma. Let f : S n — > R be a function defined by 

f(a) = {B,a(A)) 
for some matrices A = (a^) and B = (bij). Let 

be the average value of f on the symmetric group S n . Let us define 



^ aij, a2 = ^2aa and 

l<ij^j<n i=l 



n 



Then 



l<i^j<n i=l 



n(n — 1) n 

Similarly, if / is a function (1.3.1) of the generalized problem, then 
1= n(n-l) E c ki + ~ E c "i- 

K^i<nl<Mi<" l<i,l<n 

We prove Lemma 1.5 in Section 6. 

(1.6) Notation. We often denote by c some positive constant whose precise value 
is not of particular importance to us. If F and G are non-negative functions of a 
positive integer n, we write F = 0(G) if F{n) < cG(n) for some c > and all 
sufficiently large n. Similarly, we write F(n) = 0(G) if F(n) > cG(n) for some 
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constant c > and all sufficiently large n. We denote by e the identity permutation 
in S n . We denote by \X\ the cardinality of a finite set X and by conv A the convex 
hull of the set A in Euclidean space. Given a function / : S n — > R, we denote by 
/ its average value on S n : 

and by 

/o = /-7 

the "shifted" function with average. Our results concern the function f . 

The paper is organized as follows. In Sections 2-5, we state our results about the 
number of near-optimal permutations. In Sections 6-11, we prove those results and 
describe certain "extreme" distributions. We give an informal preview of our results 
below. In what follows, r is an optimal permutation such that /o( r ) > fo( (J ) f° r au 
a G S n . Since we consider the shifted function, the minimization and maximization 
problems are completely similar. 

In Section 2, we consider a special case of the problem where matrix A is sym- 
metric, has constant row and column sums and a constant diagonal (of course, A 
and B are interchangeable). For example, the symmetric TSP belongs to this class. 
The interesting feature of this special case is what we call the "bullseye" distribu- 
tion of values of fo around its maximum. It turns out that the average value of fo 
over the k-th ring U(r, k) (see Definitions 1.2) around an optimal permutation r 
steadily improves as the ring contracts to r. The proof is given in Section 8. This 
is also the simplest case to analyze. It turns out that the set of all possible central 
projections g (see (1.4)) is one-dimensional. 

In Section 3, we consider a more general case of a not necessarily symmetric 
matrix A with constant row and column sums and a constant diagonal. For ex- 
ample, the asymmetric TSP belongs to this class. We call this case "pure" since 
the objective function / lacks the component that can be attributed to the Linear 
Assignment Problem. Although we don't have the bullseye distribution of Section 
2, we can provide some guarantees for the number of reasonably good permutations 
a. Thus, for any a > 1 the probability that a random permutation a G S n satisfies 

fo( <J ) > ~^fo( T ) is a t least 0(n -2 ). Furthermore, for any e > the probability that 

a random permutation a satisfies fo(cr) > n~ e fo(T) is "mildly exponential", that is 
at least of the order of exp{— n c } for some constant c = c(e) < 1. The proof is given 
in Section 9. It turns out that the set of all central projections g, maximized at the 
identity, forms a 2-dimensional cone. The extreme rays provide us with the extreme 
types of distributions, which, although not as good as the "bullseye" distribution 
of Section 2, still quite reasonable, especially compared with types of distributions 
we encounter in general symmetric QAP. 

In Section 4, we consider the symmetric Quadratic Assignment Problem, where 
matrix A (or, equivalently B) is symmetric. This case turns out to be very different 
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in many respects from the special cases of Sections 2 and 3. It turns out that the 
"bullseye" distribution is no longer the law. We present a simple example of function 
fo where the average value of fo over the k-th ring U(t, k) of an optimal permutation 
t is much worse than the average over the whole group S n even for small k. We call 
such a distribution a "spike" . We argue that at least for the generalized problem 
(1.3), the number of near-optimal permutations is much smaller than in the pure 
case of Section 3. The proofs are given in Section 10. It turns out that the set of 
all central projections (1.4) forms a 2-dimensional cone whose extreme rays provide 
us with the extreme types of distributions. One of those rays turns out to have an 
extreme "spike" distribution. 

In Section 5, we consider the general Quadratic Assignment Problem. As in 
Section 3, we prove that for any a > 1 the probability that a random permutation 

ry 

a G S n satisfies fo{&) > — jj/oC 7 ") is a t least fi(n~ 2 ), although with a worse constant 

than in Section 3. We prove that for any e > there is a constant c(e) < 1 such 
that the probability that a random permutation a satisfies fo(cr) > n _1_e /o(T) is at 
least of the order exp{— n c } (mildly exponential). The proofs are given in Section 
11. It turns out that the set of central projections (1.4) forms a 3-dimensional 
polyhedral cone with 4 extreme rays when n is even and 5 extreme rays when n is 
odd. In a sense, those extreme rays describe all "extreme" distributions that one 
may encounter in the general Quadratic Assignment Problem. 

In Section 6, we prove some preliminary technical results. In Section 7, we review 
the necessary facts from the representation theory of the symmetric group, which 
we use essentially in our approach. 



2. The Bullseye Case 



Our analysis of the Quadratic Assignment Problem is the simplest in the follow- 
ing special case (it also exhibits some features absent in the general case). Suppose 
that the matrix A = (ciij) is symmetric and has constant row and column sums and 
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a constant diagonal: 
a 



'i-J 



ciji for all 1 < i,j < n; 



for some a 

n 

ciij = a for all j = 1, . . . , n and 



i=i 

n 



ciij = a for all i = 1, . . . , n; 



For example, 



A = 



= b for some b and all i = 1, . . . , n. 
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0/ 



a ij ~ 



1 if \i — j\ = 1 mod n 
otherwise 



satisfies these properties and the corresponding optimization problem is the Sym- 
metric Traveling Salesman Problem. It turns out that the optimum has a charac- 
teristic "bullseye" feature in the Hamming metric on S n (see Definition 1.2). 

(2.1) Theorem. Suppose that the matrix A is symmetric and has constant row 
and column sums and a constant diagonal. Let f : S n — > R be the function defined 
by (1.1.1) for A and some matrix B. Let f be the average value of f on S n , let 
fo = f — f and let r G S n be an optimal permutation: fo(r) = max^g^ /o(°")- For 
k>0 let 

U(r;k) = {a : dist(a,r) = k) 
be the k-th "ring" around r and let 



ct(n, k) 



(n - k) 2 - 3(n - k) 
n 2 — 3n 



Then 



1 



\U(r-k)\ 



fo(o-)>a(n,k)fo(r). 



a€U(r;k) 

We prove Theorem 2.1 in Section 8. 



(2.2) The "bullseye" distribution. Connections with the local search. It 

follows from our proof that we have almost equality in the formula of Theorem 2.1. 
We observe that as the ring U(t; k) contracts to the optimal permutation r, the 
average value of / on the ring steadily improves. 

Distribution of values of the objective function with respect to the Hamming distance 
from the maximum point 




Maximum value Large average 



Medium average 



Small average 



Figure 1 

It is easy to construct examples where some values of / in a very small neighborhood 
of the optimum are particularly bad, but as follows from Theorem 2.1, such values 
are relatively rare. In our opinion, this provides some justification for the local 
search heuristic, where one starts from a permutation and tries to improve the value 
of the objective function by searching a small neighborhood of the current solution. 
Indeed, if we had the value of /o(c) for each a G U(t, k) equal to a(n, k)fo(r), then 
the local search would have converged to the optimum in 0(n) steps, since each step 
would have brought us to a smaller neighborhood of the optimal solution. Instead, 
we have that the average value over U (r, k) is (almost) equal to a(n, k)fo(r) . We can 
no longer guarantee that the local search converges fast (or even converges) to the 
optimal solution (after all, our problem includes the Traveling Salesman Problem 
as a special case and hence is NP-hard), but it plausible that the local search 
behaves reasonably well for an "average" optimization problem. This agrees with 
the empirical evidence that the local search works well for the Traveling Salesman 
Problem. 

Incidentally, one can prove that the same type of the "bullseye" behavior is 
observed for the Linear Assignment Problem and some other polynomially solvable 
problems, such as the weighted Matching Problem. 

Estimating the size of the ring U(t, k), we get the following result. 

(2.3) Theorem. Suppose that the matrix A is symmetric and has constant row 
and column sums and a constant diagonal. Let f : S n — ► R be the function defined 
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by (1.1.1) for A and some matrix B, let f be the average value of f on S n and 
let fo = f — f. Let t be an optimal permutation: fo(r) = max ff€ s n fo(°~)- Let us 
choose an integer 3 < k < n — 5 and a number < 7 < 1 and let 

I3(n,k) = — — ^. 

n z — 3n 

The probability that a random permutation a G S n satisfies the inequality 



fo(o-)>7P(n,k)fo(r) 

is at least 

(l-7)£(n,fc) 
3A;! 

We prove Theorem 2.3 in Section 8. 

Our results can be generalized in a quite straightforward way to functions / 
defined by (1.3.1), if we assume that for any k and / the matrix A = (ay), where 
aij = c£t, is symmetric with constant row and column sums and has a constant 
diagonal. 

3. The Pure Case 

In this Section, we consider a more general case of a not necessarily symmetric 
matrix A having constant row and column sums and a constant diagonal: 

for some a 
a for all j = 1, . . . , n and 

a for all % = 1, . . . , n; 



i=i 

n 



3=1 



an = b for some b and all i = 1, . . . , n. 



For example, matrix 



/ 




A = 



1 





1 





V 1 








\ 




1 

/ 



a ij ~ 



1 if j = i + 1 mod n 
otherwise 



satisfies these properties and the corresponding optimization problem is the Asym- 
metric Traveling Salesman Problem. 

We call this case pure, because as we remark in Sections 7 and 9, the objective 
function / lacks the component attributed to the Linear Assignment Problem. More 
generally, an arbitrary objective function / in the Quadratic Assignment Problem 
can be represented as a sum / = /i + /2, where fi is the objective function in a 
Linear Assignment Problem and fi is the objective function in some pure case. 

In this case we can no longer claim the bullseye distribution of Section 2 (the 
reasons are explained in Section 9), the distribution in this case is not as bad as, 
for example, in the general symmetric QAP (see Section 4) and the estimates of 
the number of relatively good values we are able to prove are almost as good as 
those of Section 2. 

(3.1) Theorem. Suppose that the matrix A has constant row and column sums 
and a constant diagonal. Let f : S n — ► K. be the function defined by (1.1.1) for A 
and some matrix B, let f be the average value of f on S n and let fo = f — f . Let 
r be an optimal permutation, so fo(r) = max (Te s n fo(o~). Let us choose an integer 
3 < k < n — 5 and a number < 7 < 1 and let 

P(n,k) = — — — • 
n z — 3n + 1 

The probability that a random permutation a G S n satisfies the inequality 

/o(<r)>7/*(»,*)/o(r) 

is at least 

(l- 7 )£(n,fc) 
lOfc! 

In particular, by choosing an appropriate k, we obtain the following corollary. 

(3.2) Corollary. 

(1) Let us fix any a > 1. Then there exists a 5 = 5(a) > such that for all 
sufficiently large n > N(a) the probability that a random permutation a in 
S n satisfies the inequality 

fo(a) > £/ (r) 

is at least 5n~ 2 . In particular, one can choose 5 = exp{— c\fa lna} for 
some absolute constant c > 0. 

(2) Let us fix any e > 0. Then there exists a 5 = 5(e) < 1 such that for all 
sufficiently large n > N(a) the probability that a random permutation a in 
S n satisfies the inequality 

/oW > n^f (r) 



is at least exp{— n s }. In particular, one can choose any 5 > 1 — e/2. 
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□ 

We prove Theorem 3.1 in Section 9. 

From Corollary 3.2, it follows that to get a permutation a which satisfies (1) 
for any fixed a, we can use the following straightforward randomized algorithm: 
sample 0(n 2 ) random permutations a G S n , compute the value of / and choose 
the best permutation. With the probability which tends to 1 as n — > +oo, we 
will hit the right permutation. The complexity of the algorithm is quadratic in n 
for any a, but the coefficient of n 2 grows as a grows. If we are willing to settle 
for an algorithm of a mildly exponential complexity of the type exp{n^} for some 
(3 < 1 we can achieve a better approximation (2) by searching through the set of 
randomly selected expjn^} permutations. We remark that no algorithm solving 
the Quadratic Assignment Problem (even in the special case considered in this 
section) with an exponential in n complexity exp{(9(n)} is known, although there 
is a dynamic programming algorithm solving the Traveling Salesman Problem in 
exp{(9(n)} time. 

Again, our results can be generalized in a quite straightforward way to functions 
/ defined by (1.3.1), if we assume that for any k and / the matrix A = (ay), where 
ciij = cj^ has constant row and column sums and has a constant diagonal. 

4. The Symmetric Case 
In this section, we assume that the matrix A = (a^) is symmetric, that is 

ciij = ciji for all 1 < i, j < n. 

Overall, the distribution of values of / turns out to be much more complicated when 
in the special cases described in Sections 2 and 3. First, we observe that generally 
one can not hope for the "bullseye" feature described in Section 2.2. 

(4.1) The "spike" distribution. Let us choose an nxn matrix A = (a y ), where 



CLij — 



1 if(y) = (12) or (y) = (21) 
otherwise, 



so 



A = 



( 
1 





1 





V 







\ 




J 



Let 



7 = 



-n 2 + 5n - 8 

8(n- 2) 
ll 



and let B = {bij), where 



'13 



if i = j 

7 if i < 2 and j > 3 or if j < 2 and i > 3 
1/2 otherwise, 



so 



B = 






1/2 
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7 


... 7 \ 


1/2 
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7 





1/2 


... 1/2 
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1/2 





1/2 
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1/2 




1/2 / 



Let / : S n — > R be the function defined by (1.1.1). In Section 10, we prove the 
following properties of /. 



• We have / = for the average value of / on S n ; 

• The maximum value of / on S n is 1 and is attained, in particular, on the 
identity permutation e; 

• For the k-th ring U (e, k) centered at the identity permutation e, we have 



1 „. . —nk + k 2 + 3n - k - 4 

2^ /(*) < — 



\U(e,k)\ 



aEU(e,k) 



2n-4 



We observe that already for k = 4 (a more careful analysis yeilds k = 3) the 
average value of / over U(e, k) is negative for all sufficiently large n. Thus an 
average permutation in U (e, 4) presents us with a choice worse than an average 
permutation in S n . The distribution of values of / turns out to be of the opposite 
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nature to the bullseye distribution of Figure 1. We call it the "spike" distribution. 

Distribution of values of the objective function with respect to the Hamming distance 
from the maximum point 




Figure 2 

Of course, in this particular case the optimization problem is very easy since the 
function / attains only two different values. However, this may serve as an indi- 
cation that complicated distributions are indeed possible and the local search may 
not work well for a general symmetric QAP. Indeed, this is the case if we allow 
generalized functions (1.3.1). 

In Section 10, we show that there exists a tensor with the property that 
c% ki = c ki f° r au ^ an d I and all i and j such that for the corresponding function / 
defined by (1.3.1), we have 

( , . M = -nfrfr) ~ 1) + Pfr)(p(g) + 1) + 2t(a) - 4 

[ ' ' } I[a) 272 -4 

where p(u) = |{i : a(i) = i}\ is the number of fixed points of the permutation 
and t(a) = \{i < j '■ cr(z) = j and a(j) = i}\ is the number of 2-cycles in the 
permutation. We show that / = and that the maximum value 1 of / is attained 
at the identity permutation e (where p(cr) — n and t(a) = 0) and, for even n, on the 
permutations that consist of n/2 transpositions (where p(a) = and t(a) = n/2). 
On the other hand, for any fixed n — 3 > k > 3 and all n > 5, the value of f(a) 
with a E U(e, k) is negative. 

(4.2) Scarcity of relatively good values. Unfortunately, we are unable to 
present an example of the symmetric QAP which beats the bound of Theorem 
3.1 but we can construct such an example for the generalized problem (1.3). In 
Section 10, we prove that for any 3 < m < n, there exists a tensor such that 
c kl = c ki f° r au ^ and / and such that for the corresponding function / we have 

( 421 ) ^ = pH°) - mp{(j) + 2t(a) + m - 3 



n 2 — nm + m 
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where p(a) is the number of fixed points in a and t(a) is the number of 2-cycles in 
a. We show that / = and that /(e) = 1 is the maximum value of /. 

Let us fix any < 5 < 1 and let us choose some m such that n 1_e > m > n 5 for 
some e > 0. Then, for all sufficiently large n, the value f(a) > 2/n can be achieved 
only on permutations a with p(a) > m. The number of such permutations a 

does not exceed ( U ) (n — m)\ = nl/ml, that is, the probability that a random 
\m J 

permutation a satisfies f(a) > 2/n does not exceed exp{— n 5 } for large n. 

5. The General Case 

It appears that the difference between the general case and the symmetric case 
of Section 4 is not as substantial as the difference between the symmetric case and 
the special cases of Sections 2 and 3. Our main result is: 

(5.1) Theorem. Let f : S n — >■ K. be the function defined by (1.1.1) or (1.3.1), 
let f be the average value of f on S n and let fo = f — f . Let r be an optimal 
permutation: /o(t) = max (Te s n fo(o~). Let us choose an integer 3 < k < n — 5 and 
a number < 7 < 1. Let 

k — 2 

fc ) = ~ r— T — o- 

n z — nk + k — 2 

The probability that a random permutation a G S n satisfies 

/o(<t)>7/?(M)/o(t) 

is at least 

5A;! 

In particular, by choosing an appropriate /c, we obtain the following corollary. 

(5.2) Corollary. 

(1) Let us fix any a > 1. Then there exists a 5 = 6(a) > such that for all 
sufficiently large n > N(a) the probability that a random permutation a in 
S n satisfies the inequality 

fo(a) > 4/o(r) 

is at least 5n~ 2 . In particular, one can choose 5 = exp{— calna} for some 
absolute constant c > 0. 

(2) Let us fix any e > 0. Then there exists a 5 = 5(e) < 1 such that for all 
sufficiently large n > N(e) the probability that a random permutation a in 
S n satisfies the inequality 

/oW > n-^foir) 

is at least exp{— n s }. In particular, one can choose any 5 > 1 — e. 
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□ 

We prove Theorem 5.1 in Section 11. As in Section 2, we conclude that for any 
fixed a > 1 there is a randomized 0(n 2 ) algorithm which produces a permutation a 
satisfying (1). If are willing to settle for an algorithm of mildly exponential complex- 
ity, we can achieve the bound of type (2), which is weaker than the corresponding 
bound of Corollary 3.2. 

In Section 11, we construct an example of a function of type (1.3.1) with an even 
sharper spike distribution than in example 4.1. 



6. Preliminaries 

First, we prove Lemma 1.5. 

Proof of Lemma 1.5. Let us choose a pair of indices 1 < % ^ j < n. Then, as a 
ranges over the symmetric group S n , the ordered pair (a(i), ^(j)) ranges over all 
ordered pairs (/c, /) with 1 < k ^ I < n and each such a pair (/c, /) appears (n — 2)! 
times. Similarly, for each index 1 < i < n, the index a{i) ranges over the set 
{1, . . . , n} and each j e {1, . . . , n} appears (n — 1)! times. Therefore, 

n n 



' &es n i,j=i i,j=i ' <yes n 

n(n — 1 ) n ' n(n — 1 n 

iytj 1 = 1 



and the proof follows. □ 

Suppose that f(a) = (B,a(A)} for some matrices A and B and all a £ S n and 
suppose that the maximum value of / is attained at a permutation r. Let A\ = t(A) 
and let fi(o~) = (B,a(Ai)). Then fi(a) = /(err), hence the maximum value of f\ 
is attained at the identity permutation e and the distribution of values of / and 
fi is the same. We observe that if A is symmetric then A\ is also symmetric, and 
if A has constant row and column sums and a constant diagonal then so does Ai 
(see also Section 7). Hence, as long as the distribution of values of / is concerned, 
without loss of generality we may assume that the maximum of / is attained at the 
identity permutation e. 

M be a function. Let us define function g : 

We call g the central projection of /. 

The following simple observation is quite important for our approach. 
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(6.1) Definition. Let / : S r , 

S n — >■ R by 



(6.2) Lemma, let f : S n — ► K. be a function such that /(e) > f(a) for all a G S n 
and let g be the central projection of f . Then g(e) = /(e) > g(a) for all a G S n 
and the average values of f and g are equal: f = g. 

Proof. We observe that u~ 1 euj = e for all u G S n and hence g(e) = /(e). Moreover, 
for any a G S n 



Finally, 

= i [ E7 = 7 

n! 



and the proof follows. □ 

Moreover, one can observe that the averages of / and g on the k-th ring U(e, k) 
coincide for all k = 0, . . . , n, see Definition 1.2. 

We will rely on a Markov type estimate, which asserts, roughly, that a function 
with a sufficiently large average takes sufficiently large values sufficiently often. 

(6.3) Lemma. Let X be a finite set and let f : X — > R be a function. Suppose 
that f(x)<l for all x G X and that 



— — ~S~] f{x) > (3 for some (3 > 0. 
\X J 

Then for any < 7 < 1 we have 

\{xeX:f{x)>f3 1 }\>f3{l- 1 )\X\ 

Proof. We have 



</3 7 + 
Hence 



^]Y\^ m= \xl £ f^ + Txl £ /(*) 

{x:/(x)>/3 7 }| 



|{x:/(x)>)87}| >/?(!- 7)|*| 



□ 



Finally, we need some facts about the structure of the symmetric group S n (see, 
for example, [6]). 
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(6.4) The conjugacy classes of 5, 



Let us fix a permutation p G S n . As w ranges over the symmetric group S n , the 
permutation uj~ x pu ranges over the conjugacy class of X{p) of p, that is the set of 
permutations that have the same cycle structure as p. 

We will be using the following facts. 

(6.4.1) Central projections and conjugacy classes. If / : S n — > K is a 

function and g : S n — ► R its central projection, then 

9 ^=\xJpj-\ £ /(a) - 

1 KfJJl aeX(p) 

If X C S n is a set which splits into a union of conjugacy classes X(pi) : i G /, and 
for each such a class we have 



l*(Pi), =V( 



E /(^ 



for some number a, then 



,4 E / w ^ «• 

(6.4.2) Permutations with no fixed points and 2-cycles. Let us fix some 
positive integers Cj : % = 1, . . . , m and let a n be the number of permutations in S n 
that have no cycles of length a for 1 < % < m. The exponential generating function 
for a n is given by 

oo m r . 

Ea n 1 r ^ x Cl 

—x = exp< — > — 
nl 1-x I ^-f q 

n=0 i=l 

where we agree that ao = 1, see, for example, pp. 170-173 of [7]. It follows that 
the number of permutations a G S n without fixed points is asymptotically e~ l n\ 
and without fixed points and 2-cycles is e _3 / 2 n!. We will use that the first number 
exceeds n!/3 and the second number exceeds n\/5 for n > 5. 

(6.4.3) Permutations with many fixed points and 2-cycles. The number of 
permutations a G S n with at least k fixed points is at most n\/k\, since to choose 

such a permutation, we can first choose k fixed points in ways and then choose 

an arbitrary permutation of the remaining (n — k) elements in (n — k)\ ways (some 
permutations will be counted several times) . Similarly, the number of permutations 

nl 

a G S n with at least k transpositions (2-cycles) is at most jj^i since to choose 

77-1 

such a permutation, we first choose some k pairs in , ways and then 

an arbitrary permutation of the remaining n — 2k elements in (n — 2k)\ ways (again, 
some permutations will be counted several times). 
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7. Action of the Symmetric Group in the Space of Matrices 



The crucial observation for our approach is that the vector space of all central 
projections g of functions / defined by (1.1.1) or (1.3.1) is 4-, 3-, or 2- dimensional 
depending on whether we consider the general case, the cases of Sections 3 and 
4 or the special case of Section 2. If we require, additionally, that / = then 
the dimensions drop by 1 to 3, 2 and 1, respectively. This fact is explained by 
the representation theory of the symmetric group (see, for example, [6]). In this 
section, we review some facts that we need. Our notation is inspired by the generally 
accepted notation of the representation theory. 

We describe some important invariant subspaces of the action of S n in the space 
of n x n matrices Mat n by simultaneous permutations of rows and columns. We 
recall that n > 4. 

(7.1) Subspace L n . Let L\ be the space of constant matrices A: 

ciij = a for some a and all 1 < i,j < n. 

Let l? n be the subspace of scalar matrices A: 

a if i = j 
if i^j 



ciij 



for some a. 



Finally, Let L n = L\ + L 2 n . One can observe that dimL n = 2 and that L n is the 
subspace of all matrices that remain fixed under the action of S n . 

(7.2) Subspace L n _i i. Let L\_ x 1 be the subspace of matrices with identical 
rows and such that the sum of entries in each row is 0: 



A = 



« 2 

a 2 



\ 



\ 



where ol\ + . . . + a n = 0. 



«1 «2 



a 



J 



Similarly, let L 2 n _ x l be the subspace of matrices with identical columns and such 
that the sum of entries in each column is 0: 





/ «i 




■ Oil \ 


A = 










V OL n 


OL n ■ 





where a\ + . . . + a n = 0. 



Finally, let Lf l _ 11 be the subspace of diagonal matrices with the zero sum on the 
diagonal: 





( ai 


.. 


. 


\ 


A = 





«2 


. 







V o 


.. 


. 


a n 1 



, where ol\ + . . . + a n = 0. 
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Let L n _i jl = ! + ^n-1,1 + ^n-1,1- O ne can cnec k that the dimension of each 
of L\_ x l5 i and L^_ 1:1 is n — 1 and that dimL n _i i = 3n — 3. Moreover, the 
subspaces i, -k^-i l an< ^ ^n-i l do n °t contain non-trivial invariant subspaces. 
The action of S n in L n _i i, although non-trivial, is not very complicated. One can 
show that if A e £ n -i,i + -^n, then the problem of optimizing /(a) defined by 
(1.1.1) reduces to the Linear Assignment Problem. 

(7.3) Subspace L n _ 2)2 . Let us define L n _ 2)2 as the subspace of all symmetric 
matrices A with row and column sums equal to and zero diagonal 

dij = ciji for all 1 < i,j < n; 

n 

dij = for all j = 1, . . . , n; 

i=i 

n 

dij = for all i = 1, . . . , n and 

da =0 for all i = 1, . . . , n. 

One can check that L n _ 2j2 is an invariant subspace and that dimL n _ 22 = (n 2 — 
3n)/2. Besides, L n -2,2 contains no non-trivial invariant subspaces. 

(7.4) Subspace L n _ 2iljl . Let us define L n _ 2iljl as the subset of all skew symmet- 
ric matrices A with row and column sums equal to 0: 

dij = —dji for all 1 < i,j < n; 

n 

dij = for all j = 1, ... ,ra and 

n 

a^j = for all % = 1, . . . , n. 

One can check that L n _ 2 ,i,i is an invariant subspace and that dimL n _ 2 i i = 
(n 2 — 3n)/2 + 1. Similarly, L n _ 2 ,i,i contains no non-trivial invariant subspaces. 

One can check that Mat n = L n + L n _i ; i + L n _ 2j2 + L n -2,i,i- The importance 
of the subspaces (7.1)-(7.4) is explained by the fact that they are the isotypical 
components of the irreducible representations of the symmetric group in the space 
of matrices. The following proposition follows from the representation theory of 
the symmetric group [6]. 

(7.5) Proposition. For annxn matrices A and B, where n > 4, let f : S n — > R 

be the function defined by (1.1) and let g : S n — ► R, 
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be the central projection of f . Given a permutation a G S n , let 

p(a) = \{i '■ cr(i) = i}\ and t(a) = \{i < j '■ o~(i) = j and cr(j') = i}\ 

be the number of fixed points of the permutation and the number of 2-cycles in the 
permutation correspondingly. 

(1) If A e L n then g is a scalar multiple of the constant function 

Xn (o-) = l for all a G S n ; 

(2) If A G L n _\ \ then g is a scalar multiple of the function 

Xn-i,i( a ) = P( a ) ~ 1 f° r al1 a e S n, 

(3) If A G L n _ 2j 2 then g is a scalar multiple of the function 

1 3 

Xn-2,2(» = t(a) + 2^ 2 ( a ) - aU a E Sn; 

(4) If A G Ln-2,i,i then g is a scalar multiple of the function 

1 3 

Xn-i,i,i{(j) = -P 2 {(j) - ~p((y) ~t(a) + 1 for all a G S n . 



The functions % n , Xn-i,i, Xn-2,2 and Xn-1,1,1 are the characters of correspond- 
ing irreducible representations of S n for n > 4. They are linearly independent, 
and, moreover orthogonal: J2 a eS n Xi( a )Xj i* 7 ) = for two characters of different 
irreducible representation of S n . In particular, 

Xn-l,l(<7) = Y Xn-2,2^) = Yl Xn-2,l,l(<T) = 0, 
crESn <y(zS n <y(zS n 

hence the average value of all but the trivial character x-n is 0. 

(7.6) Remark. It follows [6] that each of the functions Xm Xn-i,i? Xn-2,2 and 
Xn-2,1,1 is the objective function (1.3.1) in some generalized problem with a tensor 
c£ t (see Section 1.3) with the property that for all k and I the matrix A = (a^) 
for aij = belongs to the corresponding subspace. Since the set of all func- 
tions (1.3.1) is closed under linear combinations, it follows that every function 

/ G span{x n? Xn— 1,1? Xn— 2,2? Xn — 2,1, 

1} is an objective function in the generalized 

problem. 
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8. The Bullseye Case. Proofs 



In this section, we prove Theorem 2.1 and Theorem 2.3. An important observa- 
tion is that A satisfies the conditions of Section 2 if and only if A G L n + L n _ 2 ,2 
(see Section 7). 

Proof of Theorem 2.1. Without loss of generality, we may assume that the maxi- 
mum of fo(o~) is attained at the identity permutation e (see Section 6). Excluding 
the non-interesting case of fo = 0, by scaling /, if necessary, we can assume that 
fo(e) = 1. Let g be the central projection of fo. Then by Lemma 6.2, g = and 
1 = g(e) > g(a) for all o G S n . Moreover, since A G L n + L n _ 2 ,2, by Parts 1 and 
3 of Proposition 7.5, g must be a linear combination of the constant function Xn 
and Xn-2,2- Since g = 0, g should be proportional to Xn-2,2 and since g(e) = 1, we 
have 

2 _ 2t + p 2 - Sp 

9 9 7, Xn— 2,2 5 ~ • 

Now a G U(e, k) if and only if p(a) = n — k. Hence g(a) > a(n, k) for all a G U (e, k). 
The set U(e 7 k) splits into disjoint union of conjugacy classes X(p) and, using 
(6.4.1), we conclude that for each such X(p) 

9 ^ = TxJo)\ ^ /o(<r) >a(™,fc) 

1 Kpn aeX{p) 



and, therefore, 



fo(cr) > a(n,k), 



\U(n,k)\ ^ 

hence the proof follows. □ 

Using estimates of (6.4.2), one can show that the input of the number of 2-cycles 
t(a) into the average of fo over U(e, k) is asymptotically negligible, so there is an 
"almost equality" in the formula of Theorem 2.1. 

By estimating the cardinality of the k-th ring U(t, k), we deduce Theorem 2.3. 

Proof of Theorem 2.3. As in the proof of Theorem 2.1, we assume that the maxi- 
mum value of fo is equal to 1. 

Let us estimate the cardinality \U(r, n — k)\ = \U(e,n — k)\. Since a G U(e, n — k) 
if and only if a has k fixed points, to choose a a G U(e,n — k) one has to choose k 

points in \ wavs an d then choose a permutation of the remaining n — k points 

without fixed points. Using (6.4.2), we get 

\U(r,n-k)\>(^j(n-k)\/3=^. 
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Applying Lemma 6.3 with (3 = P(n, k) and X = U(r, n — k), from Theorem 2.1, we 
conclude that 



P{^S n :/o(<T)>7%.fe)}> 



(l--y)P(n,k)\U(T,n-k)\ 



> 



i v. 

(l- 7 )/3(n,fc) 
3fc! 



□ 



9. The Pure Case 



Proofs 



In this case, A <E L n + £ n -2,i,i + L n _ 22 (see Section 7). As in Section 8, 
the L n component contributes just a constant to /. Since the L n _\ \ component 
attributed to the Linear Assignment Problem (see Section 7.2) is absent, we call 
this case "pure". 

We choose a more convenient basis g\ and g 2 in the vector space spanned by 
Xn-2,2 and x n -2,i,i, namely: 

9i = Xn-2,2 + Xn-2,1,1 =2? 2 -3p + l and g 2 = Xn-2,1,1 - Xn-2,2 = 1 - 2t. 

(9.1) Definition. Let K p (where p stands for "pure") be the set of all functions 
g : S n — ► K. such that g G span{(7i, g 2 }, where g± = p 2 — 3p + 1 and g 2 = 1 — 2t 
and g(e) > g(a) for all a G S n , where e is the identity permutation. We call K p 
the central cone. 

Identifying spanjgi, g 2 } with two-dimensional vector space M. 2 (plane), we see 
that the conditions g(e) > g(a) define the central cone K as a convex cone in M?. 
Our goal is to find the extreme rays r\ and r 2 of K, so that every function g G K 
can be written as a non- negative linear combination of r± and r 2 . 

First, we prove a useful technical result. 

(9.2) Lemma. For a permutation a G S n , a 7^ e, let a a G M. 2 be the point 




Let P = convjacr : a 7^ e) 6e t/ie convex hull of all such points a a . 
If n is even, the extreme points of P are 



(0,0), (n-3,0), (n-2,1) and (0,1). 



// n is odd, the extreme points of P are 



(0,0), (n-3,0), (n-2,1), (0, (n-3)/n) and (1,1). 



22 



Proof. The set of all possible values (p(a) , t(a)) , where a ^ e, consists of all pairs 
of non-negative integers (p, t) such that p < n — 2, It < n and, additionally, 
p + 2t < n — 3 or p + 2t = n. To find the extreme points of the set of feasible 
points (p, 2tj (n — p)) , we choose a generic vector (71, 72) and investigate for which 
values of p and t the maximum of 

2t 

I1P + I2- 



n — p 



is attained. 

Clearly, we can assume that 72 7^ 0. If 72 < then we should choose the smallest 
possible t which would be t = unless p = n — 2 when we have to choose t = 1. 
Depending on the sign of 71, this produces the following pairs 

(p,*) = {(0,0), (n-3,0), (n-2,1)}. 

If 72 > then the largest possible value of 2t/(n — p) is 1. If 71 > this produces 
the (already included) point 

(p,t) = (n-2,1). 

If 71 < we get 

(p, t) = (0, n/2) for even n 

and 

(p,t) = |(0, (n-3)/2),(l,(n-l)/2)| for odd n. 
Summarizing, the extreme points of P are 

(0,0), (n-3,0), (n-2,1), (0,1) for even n 

and 

(0,0), (n-3,0), (n-2,1), (0, (n - 3)/n) , (1,1) for odd n 

as claimed. □ 

Now we describe the central cone K p . 
(9.3) Lemma. For n > 4 let us define the functions r\, r2 e and r2o : S n — ► R by 

n = 1 - 2t, 

p 2 - 3p - n - 6t + 2tn + 4 

r 2e = 2 — ^"n and 

n z — 4n + 4 

_ p 2 - 3p - n - At + 2tn + 3 

n 2 — 4n + 3 
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Then 



(1) If n is even then K p is a 2- dimensional convex cone with the extreme rays 
spanned by r\ and r 2e ; 

(2) If n is odd then K p is a 2-dimensional convex cone with the extreme rays 
spanned by r\ and r 2o . Cone K p contains ri e ; 

(3) If e G S n is the identity, then 

n(e) = r 2e (e) = r 2o (e) = 1. 



Proof. A function g G K p can be written as a linear combination g = a.\g\ + a 2 g 2 . 
Since p(e) = n and t(e) = 0, we have g(e) = ai(n 2 — 3n + 1) + a 2- Therefore, the 
inequalities g(e) > g(a) can be written as 

«i(n 2 - 3n + 1) + a 2 > ai(p(a) 2 - 3p(a) + l) +a 2 (l - 2t(a)), 

which, for g ^ e is equivalent to 

ai n + p a -3) +a 2 > °- 

' n—p(a) 

Using Lemma 9.2, we conclude that for even n, the system is equivalent to 

ai > 

(9.3.1) 

v ' (n - 3)ai + a 2 > 

and for odd n, the system is equivalent to 

on > 

(9.3.2) 

v ; (n - 2)ai +a 2 > 0. 

Consequently, every solution (ai,ct2) of (9.3.1) is a non-negative linear combina- 
tion of (0, 1) and (1,3 — n) and every solution of (9.3.2) is a non-negative linear 
combination of (0, 1) and (1, 2 — n). 

The functions ri, r 2e and r 2o are obtained from g 2 , gi + (3—n)g 2 and gi + (2 — n)g 2 
respectively by scaling so that the value at the identity becomes equal to 1. 

Since every solution of (9.3.1) is a solution of (9.3.2), we conclude that r 2e G K p 
for odd n as well. □ 

(9.4) Remark. If n is even, then r 2o K p . Indeed, if a is a product of n/2 
commuting transpositions, so that p(a) = and t(a) = n/2, then r 2o (a) = (n 2 — 
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3n + 3)/(n 2 - An + 3) > 1 = r 2o (e). 



The central (pure) cone 





2 



n is even 



n is odd 



Figure 3 



The functions r 2o and r 2e have the bullseye distribution of Section 2. The distri- 
bution type of T\ may be characterized as that of a "damped oscillator" with the 
averages over the /c-ring U(e,k) changing sign and going fast to as k grows. Hence 
a typical function from the central cone has a "weak" bullseye type distribution, 
which becomes weaker as the function becomes closer to r\. 

(9.5) Lemma. Let g G K p be a function such that g(e) = 1. For any 1 < k < 

n — 2, let o~k be a permutation such that p(crfc) = k and t{a^) = and let 9k be a 
permutation such that p(8k) = k and t(6k) = 1. Then 

max{g(<T k ),g(O k )} > n2 _ 3n + r 



Proof. Applying Lemma 9.3, we may assume that g is a convex combination of r\ 
and r 2 o, hence g = ol\T\ + a 2 r 2o , for some «i, a 2 > and a± + a 2 = 1. Then 

/c 2 - 3fc - n + 3 

y(o-fc) = ai + a 2 - 2 

— 4n + 3 

and 

k 2 - 3k + n - 1 

#(0 fc = -ai + a 2 2 , , o • 

rr — 4n + 3 

We observe that if ol\ = 1 and a 2 = then g{cr k ) > g(&k) and if ol\ = and o; 2 = 1 
then <7(<7fc) < g(0k)- Moreover, as («i, a 2 ) change from (1, 0) to (0, 1) function g{ak) 
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decreases and function g(0k) increases. Hence the minimum of max{(7(cr fc ), g(9k)} 
is attained when g(<Jk) = g(9k)- This produces the system of linear equations 



k 2 - 3k - n + 3 k 2 - 3k + n - 1 

ai + " 2 2 a 7 q = + tt2 2 /i T q 

n 2 — An + 3 n l — An + 3 

and 

«1 + «2 = 1 

with the solution 

n-2 _ n 2 - 4n + 3 

1 n 2 — 3n + 1 2 n 2 — 3n + 1 

The corresponding value of g(0k) = g{&k) is 

k 2 - 3k + 1 
n 2 — 3n + 1 ' 

which completes the proof. □ 

Now we are ready to prove Theorem 3.1. 

Proof of Theorem 3.1. Without loss of generality, we may assume that the maxi- 
mum value of fo is attained at the identity permutation e (see Section 6). Excluding 
an obvious case of fo = 0, by scaling /, if necessary, we may assume that fo(e) = 1. 
Let g be the central projection of fo. By Lemma 6.2, g(e) = fo(e) = 1 > g(a) for all 
a E S n and g = 0. Moreover, since A e L n + L n _ 2 ,2 +£n-2,i,i) by Proposition 7.5, 
g must be a linear combination of the constant function Xn and functions Xn-2,2 
and Xn-2,i,i- Since g = 0, g is a linear combination of x n _2,2 and Xn-2,1,1 alone. 
Therefore, g lies in the central cone: g G K p , see Definition 9.1. 

Let us choose a 3 < k < n — 5 and let be the set of permutations cr such 
that p(a) = k and t(cr) = and let Y k be the set of permutations 9 such that 
p(9) = k and t{9) = 1. To choose a permutation a e X*., one has to choose fixed 

points in ( ^ ) ways and then a permutation without fixed points or 2-cycles on the 



remaining (n — k) points. Then, by (6.4.2) 

Similarly, to choose a permutation 9 G Yfc, one has to choose a 2-cycle in ways, 

fixed points in ^ ^ ^ ways and a permutation without fixed points or 2-cycles 
on the remaining (n — k — 2) points. Then, by (6.4.2) 

\Y k \>-(\[ , (n-fe-2)! 



5V2/V k ) y ' 10k\ 
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Let us choose a permutation a G and a permutation 6 G Yj~ and let Z = if 
g(&k) > fi'(^fc) and Z = Y k otherwise. Then 



Z > 



lOJfe! 



and by Lemma 9.5, 



^ n 2 -3n+l f0raU 



The set Z is a disjoint union of some conjugacy classes X(p) and for each X(p) by 
(6.4.1), we have 



1 ^ fc 2 - 3fc + 1 



<r€X(p) 



and hence 

1 V- t i \ ^ k 2 — 3k + 1 
\Z\ ^-^ n 2 — 3n + 1 

Applying Lemma 6.3 with X = Z and (3 = (3(n, k), we get that 



> {aeS n :f (a)> 7 (3(n,k)}> 



(l- 7 )/3(n,fc) 
lOfc! 



10. The Symmetric Case. Proofs 

In this case, A G L n + L n _\ \ + L n _ 2 ,2 (see Section 7). As in Sections 8 and 9, 
the L n component contributes a just a constant to /. We choose a more convenient 
basis <7i and g 2 in the vector space spanned by Xn-i,i and Xn-2,2, namely 

9i = Xn-1,1 =p-l and g 2 = IXn-2,2 + ^Xn-1,1 = P 2 + 2t - 3, 

where p(a) is the number of fixed points of a and t(a) is the number of 2-cycles in 
a. 

(10.1) Definition. Let K s ( where s stands for "symmetric") be the set of all 
functions g : S n — > R such that g G span{gi, g 2 }, where g\ = p — 1 and g 2 = 
p 2 + 2t — 3 and g(e) > g(cr) for all a G S n , where e is the identity permutation. We 
call K the central cone. 

Identifying spanjgi, g 2 } with two-dimensional vector space R 2 (plane), we see 
that the conditions g(e) > g(a) define the central cone K s as a convex cone in 
R 2 . Our immediate goal is to find the extreme rays r± and r 2 of K s , so that every 
function g G K s can be written as a non-negative linear combination of r\ and r 2 . 
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(10.2) Lemma. For n > 4 let us define the functions ri,r 2e and r 2o '■ S r , 
by 

_ 2np - 2n - p 2 - 3p - It + 6 

n 2 — 5n + 6 

-np + n+p 2 +p + 2t-4 

r 2e = and 

2n — 4 

— n 2 p + np 2 + n 2 + np + 2nt — An — 3p + 3 
T2 ° = 2n 2 - 7n + 3 ' 



Then 



(1) If n is even then K s is a 2- dimensional convex cone with the extreme rays 
spanned by r\ and r 2e ; 

(2) If n is odd then K s is a 2-dimensional convex cone with the extreme rays 
spanned by r\ and r 2o - Cone K s contains r 2e ; 

(3) If e G S n is the identity, then 

ri(e) = r 2e (e) = r 2o (e) = 1. 



Proof. A function g G K s can be written as a linear combination g = a±gi + a 2 g 2 - 
Since p(e) = n and t(e) = 0, we have g(e) = a\(n — 1) + a 2 (n 2 — 3). Therefore, the 
inequalities g(e) > g(o~) can be written as 

ai(n- 1) + a 2 (n 2 -3) > «i(p(a) - l) + a 2 (p 2 (a) + 2t(a) - 3), 

which, for a ^ e, is equivalent to 

(10.2.1) ai + a 2 (n + p(a) ^) . ) > 0. 

V n — p(a)/ 

Applying Lemma 9.2, we observe that (10.2.1) is equivalent to the system of two 
inequalities: 

cti + (2n - 3)a 2 > 

and 

ai + (n — l)a 2 > if n is even, 

na\ + (n 2 — n + 3)a 2 > if n is odd. 

Thus every pair (a\, a 2 ) satisfying (10.2.1) can be written as a non-negative linear 
combination of (2n — 3, —1) and (1 — n, 1) when n is even and (2n — 3, —1) and 
(— n 2 + n — 3, n) when n is odd. 

The generators ri, r 2e and r 2o are obtained from (2n — 3)gi — g 2 , (1 — n)gi + g 2 
and (— n 2 + n — 3)g± + ng 2 respectively by scaling so that the value at the identity 
becomes equal to 1. 
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It remains to check that r 2e G K for n odd as well. Indeed, using that 2t + p < n 
we have 

(2n - 4) (r 2e - l) = -n(p - 1) + p(p + 1) + 2* - 4 - 2n + 4 

= -n(p + 1) + p(p + 1) + 2t(p + l)(-n + p) + 2* 
< (p + 1) (— n + p) + n — p = p(—n + p) < 0. 

□ 

(10.3) Remark. The average value of ri, r2 e and r2 on SVi is 0. 

The function r± : S n — >■ R provides an example of the "bullseye" distribution 
(see Section 2.2). The maximum value of 1 is attained at the identity and at any 
transposition. The positive values of r\ occur on permutations with at least two 
fixed points and r±(a) = 0(p(a)/n) ifp(c) > 3. 

In contrast, r 2e and r 2o exhibit a spike type distribution of Section 4.1. The 
maximum value of 1 is attained at the identity and, for r2 e , on the product of n/2 
transpositions, or, for r2o, on the product of (n — 3)/2 transpositions. On the other 
hand, no permutation other than e with at least 2 fixed points yields a positive 
value. 

One can observe that if n is even then r 2o K. Indeed, if a is a product of n/2 
transpositions then r 2o (a) = (2n 2 - An + 3)/(2n 2 - 7n + 3) > 1. 




n even n odd 

Figure 4 



The picture of K s is very similar to that of K p , see Section 9.4. 

(10.4) Remark. The spike distribution. Let us consider Example 4.1. It is seen that 
f(o~) = 26 cr (i) cr ( 2 ) and hence the maximum value of / is indeed 1 and obtained, in 
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particular, on the identity permutation e. Applying Lemma 1.5, we get 

j = l + 4 7 (n-2)+0.5(n-2)(n-3) = Q 
n(n — 1) 

Let us prove that the central projection of / is the function r 2e of Lemma 10.2. 

Suppose that g is the central projection of /. It follows that g can be written as 
a linear combination g = a\r\ + a 2 r 2e . Since g(e) = ri(e) = r 2e (e) = 1, we must 
have ai+a 2 = 1. Let 9 = (12) be a transposition, hence p(9) = n — 2 and t(9) = 1. 
Then ri(6>) = 1 and r 2e (9) = 0, hence «i = g(9). 

Denoting by X the set of all transpositions in S n , by (6.4.1) we get 

^)=n^ £/(-) = 2(f) EWVP) 

=6r(^( B 2><"-^)= a 

Therefore, g = r 2e . Let be a conjugacy class with p{p) = n — k. Then 

1 ti \ t\ ~n{n- k) +n + (n - kf + (n - k) + 2t(p) - 4 

2^ f( a ) = r 2e(P) - 



creX(p) 

-nk + k 2 - k + 3n - 4 
~ 2n-4 ' 

Since the k-th ring U(e,k) splits into a disjoint union of conjugacy classes X(p) 
with p(p) = n — k, we conclude by (6.4.1) that 

1 „, . -nk + k 2 - k + 3n - 4 

2^ ^ 



\U(e,k)\ ^ J w ~ 2n-4 

1 V ' Jl aeU(e,k) 

as claimed. 

More generally, one can prove that for any function g G K 8 there is a function / 
of type (1.1.1) with symmetric A, such that / = 0, / attains its maximum at the 
identity and the central projection of / is g. 

(10.5) Remark. Scarcity of relatively good values. Let us consider the function / of 
Example 4.2. We observe that 



e 



/ = a-iri + a 2 r 2 
for 

n 2 — nm — An + 3m + 3 4n — 2m — 6 

cti = o an d 012 



n 2 — nm + m — 3 n 2 — nm + m — 3 

Thus / is a convex combination of r\ and r 2 e, hence 1 = /(e) > f(o~) for all a E S n 
and / = 0. Remark 7.6 implies that / is a generalized function (1.3.1) of the 
required type. 
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11. The General Case. Proofs 



In this Section, we prove Theorem 5.1 and describe the "extreme" distributions. 
Let us choose a convenient basis in spa,n{x n -i,i, Xn-2,2, Xn-2,1,1}: 

9i = Xn-1,1 =P~ 1, 92 = Xn-2,2 + Xn-2,1,1 + 3Xn-i,i = P 2 - 2 and 

93 = Xn-2,1,1 - Xn-2,2 = 1-2*. 

(11.1) Definition. Let K be the set of all functions g G span{<7i, #2, #3} such that 
9(e) > g(a) for all a G >S n . We call K the central cone. 

Identifying span{<7i, #2, #3} with a 3-dimensional vector space IR 3 , we see that 
conditions g(e) > g(a) define the central cone K as a convex polyhedral cone in 
IR 3 . The condition g(e) = 1 defines a plane H in R 3 and the intersection B = HC\K 
is a 6ase of that is, a polygon such that every g E K can be uniquely represented 
in the form g = Xh for some h G B. 

Our goal is to determine the structure of K. This is somewhat more complicated 
than in the 2-dimensional situations of Sections 9-10. 

(11.2) Proposition. Let us define functions 



7*1 = 


—np + n + p 2 — 2 


n-2 


T2 = 


1 - 2t, 


rz = 


2np - 3p - 2n - p 2 - 2t + 6 


n 2 — 5n + 6 


r 4 = 


p + 2t - 2 

ana 

n-2 


r 5o = 


-2np + 3p 2 - 3p + 2tn + n - 3 


n 2 - 2n - 3 



Then 

(1) If e G S n is the identity, then 

ri(e) = r 2 (e) = r 3 (e) = r 4 (e) = r 5o (e) = 1; 

(2) If n is even then r\,r2,rs and r 4 are the vertices (in consecutive order) 
of the planar quadrilateral B = convjri, r2, r^, r 4 } which is a base of the 
central cone K; 

(3) If n is odd then ri,r2,r3,r 4 and r^ are the vertices (in consecutive order) 
of the planar pentagon B = convjri, ri, rs, r 4 , rs } which is a base of the 
central cone K. 
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Proof. A function g e spanjgi, (727 <73} can be written as a linear combination g = 
ctigi + oliQi + &393- Then (7(e) = ai(n — 1) + a^^ 2 — 2) — a 3 and the conditions 
#(e) > g(a) are written as 

ai(n- 1) +« 2 (n 2 - 2) + a 3 > a x (p{a) - l) + a 2 (p 2 (a) -2) + a 3 (l - 2t(a)), 

which, for a 7^ e are equivalent to 

, , ,v 2t(a) 

ol\ + a 2 (n + p(cr)) + a 3 -— > 0. 

v ' n — p(a) 

Applying Lemma 9.2, we see that for even n, the system is equivalent to 

cki + na 2 > 

ai + (2n - 3)a 2 > 

(11.2.1) 

v y ai + (2n - 2)a 2 + a 3 > 

CKi + B«2 + «3 > 

whereas for odd n, the system is equivalent to 

ai + na 2 > 

ai + (2n - 3)a 2 > 
(11.2.2) ai + {2n - 2)a 2 + a 3 > 

«i + (n + l)a 2 + a 3 > 

nai + n 2 a 2 + (n — 3)a 3 > 0. 

The set of all feasible 3-tuples (cti, a 2 , 013) is a polyhedral cone, which, for even n, 
has at most 4 extreme rays and for odd n has at most 5 extreme rays. We call an 
inequality of (11.2.1)-(11.2.2) active on a particular tuple if it holds with equality. 

It is readily verified that for even n the following tuples span the extreme rays 
of the set of solutions to (11.2.1): 

(—n, 1, 0) 4th and 1st inequalities are active 

(0, 0, l) 1st and 2nd inequalities are active 

(2n — 3, —1, l) 2nd and 3d inequalities are active 

(l, 0, — l) 3d and 4th inequalities are active 

and that for odd n the following tuples span the extreme rays of the set of solutions 
to (11.2.1): 

(—71, 1, 0) 5th and 1st inequalities are active 

(0, 0, l) 1st and 2nd inequalities are active 

(2n — 3, —1, l) 2nd and 3d inequalities are active 

(l, 0, — l) 3d and 4th inequalities are active 

(— 2n — 3, 3, — n) 4th and 5th inequalities are active 
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We obtain r\,r 2 ,rz-,r± and r 5o by scaling the corresponding linear combinations 
<y.\g\ + Q-292 + &393 so that the value at the identity is equal to 1 and hence 
r i, r 2, 7*3, 7*4 and r 5o lie on the same plane in spanjgi, g 2 , (73}. □ 



(11.3) Remark. One can observe that if n is even then r 5o ^ if, for if a is a 
product of n/2 commuting transpositions, so that p(cr) = and t(a) = n/2, then 
r 5 o(v) = (n 2 + n - 3)/(n 2 - 2n - 3) > 1 = r 5o (e). 



The base of the central cone 





5o 



n is even 



« is odd 



Figure 5 



We observe that function r% coincides with function v\ of Lemma 10.2 (the sym- 
metric QAP) and that function r 2 coincides with function r\ of Lemma 9.3 (the 
pure QAP). Function r 4 has a bullseye type distribution (see Section 2.2) whereas 
r\ is a sharp spike (see Section 4.1). We have r\{a) = 1 if and only if a = e or 
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dist(e, a) = n and r\(a) < for all other a. 



Distribution of values of the objective function 
with respect to the Hamming distance 
from the maximum point 




Maximum value 



Figure 6 



Below average value 



Function r^ Q resembles a spike, but diluted. 

Now we are getting ready to prove Theorem 5.1. 

(11.4) Lemma. Let g G spanjgi, g 2 , #3} be a function such that g(e) = 1. For a 
2 < k < n — 2, let a k be a permutation such that p(ak) = k and t(o"fc) = 0, let n be 
a permutation such that p{rf) = and t(rj) = 1 and let 9 be permutation such that 
p(9) = t{6) = 0. Then 



max{g(a k ),g(n),g(e)} > 



k-2 



rr 



kn + k-2 



Proof. We can write 



9 = OLi- 



p — 1 p 2 — 2 
+ a 2 



+ a 3 (l-2£) 



n-1 ' ~~*n 2 -2 
for some a±, a 2 and a 3 such that ol\ + a 2 + 0:3 = 1. Then 

k — 1 k 2 — 2 

g(a k ) = «i + a 2 ^ — - + a 3 

n — 1 n z — 2 

«i 2 

gyn) = 7 - "2— — ~ - «3 

n — 1 — 2 

ff(fi) = -^l- a2 ^2 +a3, 
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We observe that g(<Jk), g(v) and g(0) are linear functions of 0:1,0:2 and 03 and 
hence 

^(01,02,03) = max{sr((7 fc ), #(??), #(#)} 

is a convex function on the plane 01 + 02 + 03 = 1. 
Moreover, for 

fii a ,\ fe(l-n) n 2 -2 

« 1 = n 2 -nk + k-2 > a2= n*-nk + k-2 ^ " 3 = ° 

we have 

k-2 



(11.4.2) g(a k ) = g( V ) = g(6) 



n 2 — nk + k — 2 



Let us prove that the minimum of £(01, 02, 03) on the plane 01 + 02 + 03 = 1 is 
attained at (11.4.1). Let 

n 2 — 2n n 2 — nk n 2 — kn — 2n + 2k 
Xl = W^2k> X2 = ^}-4 and As = 2k • 

Then 

><ig(o- k ) + \ 2 g(n) + \ 3 g(9) = ax + a 2 + a 3 = 1 and Ai,A 2 ,A 3 >0. 

Comparing this with (11.4.2), we conclude that there is no point (01,02,03) with 
01 + 02 + 03 = 1 such that 

k — 2 

g(°k),g(v),9(0) < 



n 2 — nk + k — 2 

□ 

Now we are ready to prove Theorem 5.1. 

Proof of Theorem 5.1. Without loss of generality, we may assume that the maxi- 
mum value of fo is attained at the identity permutation e. Excluding an obvious 
case of fo = 0, by scaling /, if necessary, we may assume that fo(e) = 1. Let g be 
the central projection of fo. By Lemma 6.2, g(e) = fo(e) = 1 > g(a) for all a G S n 
and g = 0. By Proposition 7.5, g must be a linear combination of the functions 
Xn-, Xn-1,1, Xn-2,2 and Xn-2,i,i- Since g = 0, g is a linear combination of non-trivial 
characters Xn-1,1, Xn-2,2 and Xn-2,1,1 alone. Therefore, g lies in the central cone: 
g G K, see Definition 11.1. 

Let X k be the set of all permutations a such that p(a) = k and t(o~) = 0. As in 
the proof of Theorem 4.1, we conclude that 

1 5 k\ 
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Let Y be the set of all permutations a such that p(a) = and t(a) = 1. To choose 



a permutation a £ Y, one has to choose a transpositions in y^J ways and then an 

arbitrary permutation of the remaining (n — 2) symbols without fixed points and 
2-cycles. Using (6.4.2), we estimate 

In' 1 

Let us choose a permutation G Xk, a permutation rj & Y and a permutation 
# G Xq. Let us choose Z to be one of X^, Xq and Y, depending where the maximum 
value of g(crk), g{rj) or g{6) is attained. Hence 

\z\ > ±. 

1 1 - 5fc! 

The set Z is a disjoint union of some conjugacy classes X(p) and for each X(p) by 
(6.4.1) and Lemma 11.4, we have 



aeX(p) 



and hence 



1 ^ fc - 2 

]Z| ^ /o(a) " n*-kn + k-2- 

1 1 <r€^ 



Applying Lemma 6.3 with X = Z and (3 = (3(n, k), we conclude that 

(l- 7 )/3(n,fc) 



'{aeS n :f (a)> 1 /3(n,k)}> 



5k\ 

for all re > 5. □ 
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